General Machine Learning Classifiers and Data Fusion Schemes for Efficient Speaker Recognition

نویسندگان

  • Siwar Zribi Boujelbene
  • Dorra Ben Ayed Mezghani
چکیده

Data fusion methods can take advantage of the concepts of diversity and redundancy to improve system performance. Diversity can be used to improve system performance through the incorporation of different information. Similarly, redundancy can achieve the same goals through the re-use of data. These concepts have been thoroughly applied on pattern recognition problems. The basic idea is that if several classifiers can be constructed, whose errors are mutually uncorrelated, then performance advantages can be obtained through the propel classifiers fusion. The contribution of this paper is to study the fusion of several machine learning classifiers and to analyze data fusion schemes for text independent speaker identification. Feature spaces are defined by combining the Mel-scale Filterbank Cepstrum Coefficients (MFCC) and delta coefficient. Each feature is modelled using the gaussian mixture model (GMM) that constructs a speakers’ models dictionary used later as inputs for classification. Then, four popular supervised machine learning classifiers are considered, namely the multilayer perceptrons classifier (MLP), the support vector machines classifier (SVM), the decision tree (DT) classifier and the radial basis function networks classifier (RBF). The scores (outputs) of classifiers are considered according to different scenario. Results showed that the best performance had been achieved by fusing the SVM, the MLP and the DT classifiers that reported a speaker identification rate equal to 94.15 %.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ensemble Classifiers Using Unsupervised Data Selection for Speaker Recognition

This paper presents an approach with ensemble classifiers using unsupervised data selection for speaker recognition. Ensemble learning is a type of machine learning that applies a combination of several weak learners to achieve an improved performance than a single learner. Based on its acoustic characteristics, the speech utterance is divided into several subsets using unsupervised data select...

متن کامل

Considering speech quality in speaker verification fusion

This paper emphasizes the benefits of embedding data categorization within fusion of classifiers for text-independent speaker verification. A selective fusion framework is presented which considers data idiosyncrasies by assigning particular test samples to appropriate fusion schemes. As an extension, incompatible data can be spotted and excluded from inherent classification errors. In addition...

متن کامل

Combination of Feature Selection and Learning Methods for IoT Data Fusion

In this paper, we propose five data fusion schemes for the Internet of Things (IoT) scenario,which are Relief and Perceptron (Re-P), Relief and Genetic Algorithm Particle Swarm Optimization (Re-GAPSO), Genetic Algorithm and Artificial Neural Network (GA-ANN), Rough and Perceptron (Ro-P)and Rough and GAPSO (Ro-GAPSO). All the schemes consist of four stages, including preprocessingthe data set ba...

متن کامل

Combining SVM Classifiers for Handwritten Digit Recognition

In this paper, we investigate the advantages and weaknesses of various decision fusion schemes using statistical and rule-based reasoning. The cooperation schemes are applied on two SVM (Support Vector Machine) classifiers performing classification task on two feature families referenced as structural and statistical features. The obtained results show that it is difficult to exceed the recogni...

متن کامل

Preliminary investigation of Boltzmann machine classifiers for speaker recognition

We propose a novel generative approach to speaker recognition using Boltzmann machines, a fledgeling non-Gaussian probabilistic framework that is increasingly gaining attention in several machine learning fields. We show how a modified i-vector representation of speech utterances enables the development of several Boltzmann machine architectures for speaker verification and we report some preli...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011